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Measuring instruments that are used to determine an individual 1 8 level of 
performance on a psychological or educational trait are seldom truly unidimen- 
sional. Certainly, tests based on number series or vocabulary knowledge may 
approximate unidimensional measures; but even these narrowly focused tests usu- 
ally measure more than one trait (see, e.g., Holzman, Glaser, & Pellegrino, 
1980). 

Alternatively, many tests are purposely designed to measure more than one 
trait. The English Usage Test from the ACT Assessment battery, for example, 
measures skills in punctuation, grammar, sentence structure, diction and style, 
and logic and organization (American College Testing Program, 1980). These top- 
ics have been included in the test in order to assess more thoroughly the skills 
acquired in high school English than could be obtained from a unitary measure. 
There is also a statistical motivation for constructing a test that measures 
more than one dimension. To maximize a tes^s ability to predict a criterion 
measure, the test should have items that have a high correlation with the crite- 
rion but low intercorrelations among themselves (Lord & Novick, 1968). Follow- 
ing this selection rule results in a test that measures many dimensions. 

The fact that measuring devices seldom measure single dimensions has seri- 
ous consequences for the application of item response theory (IRT) to test data. 
A basic assumption of most of the IRT models currently being applied is that the 
measuring instrument measures a single trait (Lord, 1980; Lord & Novick, 1968). 
To the extent that this assumption is violated, these IRT models may not be ap- 
propriate. Since tests seldom measure single dimensions, the unidimensional IRT 
models are only applicable if they can be shown to be robust to the violation of 
the unidimensionality assumption or if the items in a test can be sorted into 
subtests that measure a single dimension. 

The issue of the robustness of two IRT models — the 1-parameter and 3-para- 
meter logistic models — to violations of the unidimensionality assumption has 
been addressed by Reckase (1977). He found that even when the proportion of 
variance accounted for by the dominant dimension was as low as 20%, the two mod- 
els still resulted in reasonable ability estimates. However, since these esti- 
mates were of the dominant dimension, much information was lost about other 
traits being measured by the other dimensions in the tests. On the other hand, 
if the measuring instrument in question measured several traits with equal em- 
phasis, the meaning of the ability estimates was difficult to define. Thus, 
although the models do seem to be somewhat robust to violations of the unidimen- 
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sionality assumption, it is at the cost of lost information or poorly defined 
traits. It would seem that a better approach would be to estimate the ability 
on each dimension separately. 

Two different alternatives exist for obtaining estimates of the traits mea- 
sured by a test when more than one trait is being measured. First, the items in 
the test may be subdivided into groups of items that are sensitive to differ- 
ences on one of the dimensions. This procedure breaks down the test into a se- 
ries of unidimensional subtests. Unfortunately, no procedure exists that ade- 
quately performs this function when dichotomously scored test items are used 
(Reckase, 1981). Factor analysis is the procedure most commonly used for sort- 
ing items, but factor analysis suffers from several problems due to the choice 
of the correlation coefficient, the effects of guessing, and the determination 
of the number of factors (Kim & Mueller, 1978). Therefore, in many cases the 
formation of unidimensional subsets of items is not a reasonable approach. 

The second possible approach for obtaining estimates of the abilities on 
each dimension is to develop a multidimensional model of performance that re- 
lates dichotomous item responses to the magnitude of ability on each trait. 
Several models of this type have been presented in the IRT literature (Bock & 
Aitkin, 1981; Mulaik, 1972; Rasch, 1961; Samejima, 1974; Whitely, * ?80) , but 
little work has been done using these models in an applied testing acting. In 
fact, little research has been done to determine the characteristics of these 
models or the properties of the ability estimates obtained through their use. 

From this discussion it should be evident that obtaining estimates of trait 
levels from a test that measures moire than one trait is a difficult problem. 
Traditional models such as factor analysis and nonmetric multidimensional scal- 
ing are not well suited for use with dichotomously scored test items, and most 
of the IRT models that are designed for use with dichotomous test data assume a 
unidimensional test. Hie use of multidimensional IRT models may be the solution 
to this problem; however, little work has been done to demonstrate their useful- 
ness. The purpose of this paper is to review the existing multidimensional IRT 
models and to show how one of the models can be applied to the estimation of 
abilities from a test measuring more than one dimension. 

Definition of the Problem 

Most of the IRT models currently in use assume that the test being analyzed 
measures a unidimensional latent trait. This means that all persons having the 
same amount of the trait, 6, should have the same probability of a correct re- 
sponse to a dichotomously scored item. If individuals with the same level of 
the single trait have different probabilities of a correct response to a test 
item, this implies that at least one other trait is involved in responding to 
the item. If only two dimensions are required in the solution of the item, then 
all persons that have the same values on these two dimensions, [6i, 62], should 
have the same probability of a correct response. Again, if the examinees have 
different probabilities of a correct response, at least one more dimension is 
indicated. Once the number of dimensions, ri, is determined that results in a 
constant probability of a correct response for all persons with the same set of 
abilities, 8^, 62 » •••» 6 n , the size of the complete latent space has been de- 
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fined. Thia concept is discussed in more detail by lord and Novick (1968). 

Note that this method of defining the size of the complete latent space 
emphasizes the ability dimensions of the examinees while treating the test item 
as a constant stimulus. No information is given concerning the characteristics 
of the items. In order to determine the characteristics of the test items, 
critical features of the surface describing the relationship between the proba- 
bility of a correct response and a person's position in the 6 space must be de- 
fined. Two such features that are typically used in IRT are (1) the difficulty 
of the item (location of the point of inflection of the item characteristic 
curve, or ICC) and (2) the discriminating power of the item (related to the 
slope of the ICC at the point of inflection). If the relationship between the 
probability of a correct response and a person's location in the 6 space can be 
described by a sufficiently well-behaved mathematical function (e.g., the logis- 
tic function), the concept of difficulty and discrimination can be generalized 
to items that measure more than one dimension in the complete latent space. 

Suppose that the relate nship between the probability of a correct response 
to a dichotomously scored test item and a parson f s location in the 6 space is 
given by a function that is monotonically increasing for all 8 dimensions and is 
asymptotic to 0 and 1 as each 8 + -» r,rvd 8 « t respectively. That is, 

fWy) < f(8 lk ) i - 1, n , [1] 

if 6 A < 6 ik for all i^, j, and Jt, wiere _i indicates the dimension and and Jc 
indicate the person, and 

t{9 ) + 0 
as 9 ij * ' 

f(e tj ) + i 

as 8^ » , and 

1 > f (8^) > 0 for all i, .1 . 

Then, the difficulty of the item can be % defined as the values of 8 for which 

d 2 £(9) [2] 
— — - 0 , 

de 2 

if certain regularity conditions hold. This is the multivariate equivalent to 
the point of inflection of the univariate ICC. 

For some functions, f(8), the second derivative will be undefined; and for 
others, Equation 2 will yield multiple solutions. However, for a class of mod- 
els based on the logistic function, Equation 2 gives a solution that defines a 
difficulty function rather than a difficulty value for an item. This function 
is the locus of points in the 8 space that yields a .5 probability of a correct 
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response to the item. An example of the difficulty function for an item that 
measures two dimensions may help clarify this concept. 

Suppose that the complete latent space is defined by two dimensions, 6j and 
82, and that the relationship between the probability of a correct response to 
an item and the values of 8 are given by the following function: 



where aj, 02, and o 3 are values related to the shape of the probability surface 
for this particular item. An example of the probability surface is given for 0\ 
- 1.5, o 2 ■ .5, and o 3 « .65 in Figure 1. The difficulty function for this item 
is defined as 

<5 2 P(x « l|8, c) 



The second derivative is only taken with respect to 81 in this case because the 
points of inflection define the same function in both dimensions. If for sim- 
plicity, P is used in place of P(x « 1 | 8, o), the second derivative is equal to 

— = oi 2 P(l - 3P + 2P 2 ) , f5] 
68 2 PJ 

If this expression is set equal to zero and solved for P, three solutions 
result— -0, .5, and 1. Since 0 and 1 are degenerate cases where 8 » +<», the dif- 
ficulty function is defined as the intercept of the probability surface with the 
.5 plane (a plane parallel to the 8 plane at P ■ ,5) 

The line of intersection of the .5 plane with the probability surface can 
be obtained by determining which values of 8 result in a .5 probability of a 
correct response. Since the exponent in Equation 3 must be equal to zero to 
obtain a probability of .5, the appropriate values of 8 are the solutions to the 
equation 

018! + a 2 8 2 + a 3 » 0 . j 6 j 

This is the equation of a straight line in the 8 plane. In the usual linear 
form, the equation becomes 



P(x » l|6, q) » 



e (o 1 e 1 + o 2 e 2 + c 3 ) 



[3] 



+ e (ai8! + o 2 8 2 + 03) • 




[7] 



This line is shown as a dashed line on the .5 plane in Figure 1. Thus, for this 
example, the difficulty of the item is defined as a linear function instead of a 
single value. 
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Figure 1 

An Example of a Two-Dimensional Probability Surface 




The difficulty of the item in the usual latent, trait sense can be deter- 
mined on each dimension by holding the ability on the other dimension constant; 
for example, the point on the 8 2 scale that yields a ,5 probability of a correct 
response when 8i ■ 0 is -03/02* which is equal to -(-•650/1.5) ■ .43 for the 
surface given in Figure 1. Similarly, when 6 2 "0, 61 ■ -03/01 * -(-.650/.5) « 
1.3. Note that these "conditional difficulties" are different for each dimen- 
sion even though there was only one summative term in the model, 03. When the 
dimensionality of the latent space is greater than two, the difficulty function 
for an item defines a hyperplane if a logistic model is used to describe the 
probability surface. 

The definition of the discriminating power of the item in a multidimension- 
al space can also be generalized from that used in the unidimensional case. In 
the unidimensional case, the discriminating power of an item is a function of 
the slope of the ivfim response function at the point of inflection. The dis- 
criminating power of the item in a multidimensional space can likewise be deter- 
mined by calculating the slope of the item response surface along the line de- 
fining the difficulty of the item. 

This slope can be determined by evaluating the first derivative of the item 
response surface for values on the difficulty line. In the example given above, 
the first derivative of the item response surface with respect to Q\ is 
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9P_ 
30, 



= 0^(1 - P) 



(a 1 6 1 + a 2 6 2 + a 3 ) 



1 + e 



(a 1 6 1 + a 2 6 2 + a 3 ) 



1 + e 



i 

(a 1 e 1 + o 2 e 2 + a 3 ) 



• [8] 



If this function is evaluated for points on the line 8 2 * (-01/02)81 - o 3 , the 
result is .25o\ for all values of e^. Thus, 0\ is a discrimination parameter 
for the first dimension. Since the model is symmetric with respect to the 8 f s, 
the derivative with respect to 62 results in .25a 2 when evaluated at all values 
on the difficulty line. Thus, both the difficulty function and the discrimina- 
tion parameters are defined by the a parameters in the exponent of this model. 
As with the difficulty function, the discrimination of a multidimensional item 
can be generalized to many dimensions. In the general case, the discrimination 
with respect to a dimension is the slope of the item response surface at the 
difficulty hyperplane. This slope may be a function of the 9 vector in more 
complex models. 

Up to this point, the complete latent space has been defined and the con- 
cept of an item response surface (IRS) has been introduced. Extensions of the 
unidimen8ional concepts of difficulty and discrimination have also been defined 
for the multidimensional IRS. The goal of this research, however, was to esti- 
mate the amount of ability an examinee possesses on each of these dimensions in 
the multidimensional latent space. Before this goal can be attained, two steps 
must be completed. First, a reasonable and convenient form for the IRS must be 
selected; and secondly, the parameters of the item response surface for each 
item must be determined. 



Multidimensional Latent Trait Models 



A number of models already exist in the literature for approximating the 
IRS. Each of these models will now be described, and the characteristics of the 
surface that they define will be summarized. Subsequently, one model will be 
selected for further analysis, leading to the estimation of model parameters. 

Rasch's Model 



The first of the models to be produced for approximating the IRS in a mul- 
tidimensional space was presented by Rasch (1961). Although the model was not 
specifically designed to represent multidimensional data, Kasch indicated that 
vectors could be used for item and person parameters, thus extending the model 
to the multidimensional case. 



The general form of the model is given by 



(n n n n \ 

E w 4o 9 o + E u 4« a « + E £ v. fl 8 fl a + z. ) 
. £ U 1 I il 1 I m Um 1 m V 

(n n n n \ * 

E w 4o Q fl + E u.x + E E v. 0 8 ft a + z. J 



P(x » iig,9) = ^— — — x , [9] 



k 

E e 
i 
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where 

x is one of i - 1, k item responses; 

6 is the person parameter vector with elements 8^, & " 1, •••> n; 
o is the item parameter vector with elements c^, I m 1, n; 
w, u f and v are weights for the person and item dimensions; 

7 is a scaling constant for the item responses; and 
n is the number of dimensions. 

The model can be shown more conveniently in vector form as 
(W *6 + U^O + + z i ) 

P(X " a k (V0 +0^0 + 9^0 + Zi ) 1 [101 

E e 
i 

where W and U are vectors of weights for each item response, and V is a matrix 
of weights. 

This model is extremely general, allowing both dichotomous and polychoto- 
mous scoring and containing both the 1-parameter and 2-parameter logistic models 
as special cases. Because the model is so general, it is difficult to determine 
the form of the item difficulty and discrimination functions. However, for the 
special case of a dichtothously scored item with W and U equal to unit vectors 
for a correct response and zero vectors for an incorrect response, V equal to 
the identity matrix for a correct response and a zero matrix for an incorrect 
response, and £ equal to zero for all responses, the difficulty function is a 
hyperplane and the conditional slopes of the surface where it intersects the .5 
plane are functions of o±. The model presented in Equation 3 and shown in Fig- 
ure 1 is a special case of the general model. 

Only one study is known that uses this general model to represent multidi- 
mensional item response data (Reckase, 1972), although there have been other 
applications of the model (Andersen, 1982; Andrich, 1978). In the Reckase 
(1972) study an attempt was made to estimate the parameters using a least 
squares procedure for a special case of this model, where is a zero matrix 

for all responses and z^ takes on zero values for all responses. The attempt 
was not entirely successful, however, in that the fit of the multivariate model 
to multivariate data was no better than the fit of the simple Rasch (1960) model 
to the same data when estimates of the parameters were used. The poor results 
were attributed to two factors: (1) both the parameter vectors and the wights 
were estimated from the data and (2) the sample size used was too small to esti- 
mate accurately the large number of unknown variables. The parameter estimates 
were interpretable, however, suggesting that a less ambitious approach might be 
fruitful. 

Mulaik's Model 

Another multidimensional model that was developed as an extension of the 
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work of Rasch (1960) was proposed by Mulaik (1972). This model is given by 
n 

' Vi 

P(x - i|y,») - , [nl 

1 + 2 Yj^i 
i-1 1 1 

where the y^s are item parameters and the tt^s are person parameters for the 
interaction of a person and an item in an n-dimensional space. 

The previous definitions of the item difficulty and discrimination do not 
apply to this model, since the surface defined by Equation 11 does not have a 
point or line of inflection. However, the intersection of the surface with the 
.5 plane is a hyperplane and could be used to define item difficulty. Unlike 
the previous model, the conditional slope of the IRS at the intersection with 
the .5 plane is not simply a function of the item parameters but also depends on 
the ability parameters. A two-dimensional example of the response surface de- 
scribed by this model is presented in Figure 2. 

Figure 2 

Item Response Surface for Mulaik 1 s Model 




Mulaik (1972) presented a maximum likelihood procedure for estimating the 
parameters of this model, but it appears that it has not ever been applied. He 
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cautioned that the amount of computation and the constraints required to esti- 
mate the parameters may be too great for the current generation of computers. 

Sympson's Model 

A third model that has been developed to describe the interaction of a per- 
son and an item in a multidimensional latent space was described by Sympson 
(1978). Rather than extend the 1-parameter logistic model, as done by Rasch 
(1961) and Mulaik (1972), Sympson (1978) based his model on an extension of the 
3-parameter logistic model (Birnbaum, 1968). The mathematical expression for 
this model is given by 



P(x » l|e,a,b,c) - c+(l-c)Il l + e 

1-1 L J 



[12] 



where 

x is the item response, 

6 is a vector of ability parameters, 

a is a vector of discrimination parameters, 

b is a vector of difficulty parameters, and 

c is a pseudo-chance level parameter. 
An example of the surface defined by Equation 12 is given in Figure 3 for the 
two-dimensional case with parameters c - .2, aj - .7, a 2 ■ 1.2, bj » -.6, and b2 
■ • 5. 

Unlike the models presented by Rasch (1%1) and Mulaik (1972), the root of 
the second derivative of this equation does not deline a difficulty function but 
gives a single value for each dimension. This value is simply the b parameter 
for that dimension. The difficulty of an item using this model can therefore be 
defined as the vector of b values, which defines a point in the multidimensional 
space. 

The slope of the IRS at the point of inflection for Sympson's (1978) model 
is given by 

(l-c)(1.7) n L 



which is solely a function of the item discrimination parameters and the pseudo- 
chance level parameter. If the slope of the function is determined at the dif- 
ficulty point with respect to just one dimension, 8^, the result is (1 - c) 

(1.7)a i /4x2^ n " 1 ^ , where _n is the number of dimensions. Thus for this model the 
& vector defines the discrimination power of the item. 

Sympson (1978) has done some preliminary work on estimating the parameters 
of this model for some simple cases, but no procedure has yet been published for 
the full multidimensional case. Lord (1978), in discussing Sympson's (1978) 



10 



- 160 - 



Figure 3 

Item Response Surface for Sympson f s Model 




paper, has suggested that a Bayesian or maximum likelihood procedure might be 
more fruitful than the method Sympson proposed. However, these methods have not 
been developed to the point where this model can be applied to actual test data. 

Bock and Aitkin's Model 

Bock and Aitkin (1981) suggested a multidimensional latent trait model that 
is an extension of the 2-parameter normal ogive model (Lord & Novick, 1968). 
They also indicated that this model is similar to the factor analysis procedures 
developed for dichotomous data by Christof fersson (1975) and Muthen (1978). 

The mathematical form of the Bock and Aitkin (1981) model is given by the 
equation 



P(x = l|9,c,c) =* 



<*!>* I 



t 2 > 



dt 



[13] 



where 



z(e) - c + eJ^ a i e i , 

6 is a vector of ability parameters, 

a is a vector of discrimination parameters, and 

_c is the item difficulty parameter. 
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Due to the similarity of the cumulative model and logistic functions, this 
model is very similar to one of the special cases of the general Rasch model 
given in Equation 9. Specifically, it corresponds to the case where the expon- 
ent of that model is equal to Z 0.8 .+ a for a correct response and 0 for an 

4 ii nri 
incorrect response. 1 

As with the general Rasoh model, the difficulty function for the model is 
defined by the equation for a hyperplane 



The conditional slope of the IRS for points on the difficulty function is given 
by &±hj2x> demonstrating that the £ vector is related to the discriminating 

power of the item. Since the two-dimensional IRS for this model is indistin- 
guishable from the logistic surface presented in Figure 1, an example is not 
presented for this model. 

Bock and Aitkin (1981) have produced an estimation procedure for their mod- 
el based on the marginal maximum likelihood technique. They have applied thi3 
procedure *c data assuming a two-dimensional solution. The results of the anal* 
ysis showed that different quadrature procedures used in conjunction with mar- 
ginal maximum likelihood techniques gave slightly different results but that a 
two-dimensional solution seemed to fit i:he data fairly well. No other applica- 
tions of this model are known. 

Samejima 1 s Model 

Samejima (1974) presented a more general version of the model suggested by 
Bock and Aitkin (1981) as a special case of her continuous response model in a 
multidimensional latent space. Her model is given by the equation 



where 

z is the point of dichotomy of the continuous trait measured by this item, 

o 

P* (0) is the probability of a correct response, 

a is a vector of discrimination parameters, 

9 is a vector of ability parameters, 

b- is a vector of difficulty parameters, and 

^-(u) is a twice 'if ferentiable function. 
8 

When is the normal density function, Equation 15 is identical to Equation 13 

o 

with c « -Ea^b^. Wien </> g is defined as the logistic density function, the model 

is a special case of the general Rasch model given in Equation 9. Samejima also 
points out the similarity of the model to linear factor analysis. 




[14] 




[15] 
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Whitely' s Models 

One other class of multidimensional latent trait models exists in the lit.-* 
erature, but this class of models was developed from a different perspective 
than the others. The models presented up to this point generally consider the 
dimensions required to determine the complete latent space as unknown hypotheti- 
cal constructs, the properties of which need to be discovered. In contrast, the 
class of the models proposed by Whitely (1980) considers the dimensions to be 
components in a cognitive model of performance. These dimensions are defined in 
advance of being estimated as particular cognitive processes. 

The mathematical form of the model used by Wiitely is similar to that used 
by Sympson (1981) in that it is composed of the product of separate logistic 
model terms. The particular model, called the multicomponent latent trait mod- 
el, is given by the equation 



where the variables are as defined above. This model is the same as the Sympson 
(1978) model with c ■ 0 and ai ■ 1/1.7 for all i. Since it is a special case of 
the Sympson model, it also has a difficulty function equal to the b vector, and 
the slope at the point defined by the Jb vector is l/4 n . Whitely (T980) has also 
produced more complex versions of this model, but they are all composed of com- 
binations of the 1-parsmeter logistic model. 

Whitely (1980) has developed procedures for estimating the parameters of 
this model, but the estimation has been performed in a different manner than the 
other models that have been described. Whereas the estimation procedures for 
the other models have attempted to estimate the vector parameters from the di- 
chotomous responses to test items, Vhitely has developed an experimental design 
for collecting responses on each cognitive component separately. The parameters 
of each of the product terms of the model are then estimated separately using 
procedures developed for the unidimensional Rasch model. No descriptions of a 
procedure for simultaneous estimation of all of the parameters of the model has 
been found in the literature. 



An analysis of the six models that have been described above indicated that 
these models fall into three basic classes. The first of these classes (Class 
I) is of the form 



where z(8) is a linear function of the elements of 9. The general Rasch (1961) 
model, che Bock and Aitkin (1981) model, and the Samejima (1974) model fall into 



p(x = i|e,b) =* n 
i-1 



e 1 1 

(e - b 

1 + e 1 1 



[16] 



Comparison of the Multidimensional Models 




[17] 
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this class* All of these models allow high ability on one dimension to compen- 
sate for low ability on another dimension, resulting in \rtiat Sytapson (1978) has 
labeled as compensatory models. All of these models have linear difficulty 
functions when used with dichotomously scored data and have conditional slopes 
at points on the difficulty function that are functions of the corresponding 
discrimination parameters. These models are fairly simple from a mathematical 
point of view (especially wh*n ^(u) is the logistic density function), and esti- 
mation procedures have been developed for the parameters (Bock & Aitkin, 1981). 

Class II models contain a single example, the model proposed by Mulaik 
(1972). This model is of the form 



P(x » 1|y,tt) 



n 

i Vi 
i=l 



1+ I Yj^j 

i=l 



[18] 



where the variables are as defined earlier. This model is also compensatory in 
Sympson 9 8 (1978) sense, but it is unlike the previous class in that the ability 
metric is defined from 0 to +» instead of from -» to This results in an IRS 

that does not have a point or line of inflection. If the difficulty of the item 
is defined by the intersection of the item with the .5 plane, the result is a 
linear function similar to that for the Class I models. The slope of the IRS at 
the difficulty function has the property of changing with its position relative 
to the ability dimensions. Mulaik (1972) has proposed an estimation procedure 
for this model, but there have been no studies to determine its practicality. 

Class III models contain the models proposed by Sympson (1978) and Whitely 
(1980). These models take the form 

n 

P(x = l|6,o) = a + (1 - o ) n P%(8J , [19] 

i=*l 1 1 

where is a lower asymptote parameter and PjCSj) Is the probability of re- 
sponse with respect to a specific dimension. In Uhitely's (1980) model the di- 
mensions are defined as specific cognitive processes required to solve the prob- 
lem proposed in the item, and the parameter is assumed to be zero. In Symp- 
son 1 8 (1978) model the dimensions are hypothetical traits based on commonalities 
among items. 

Unlike the previous models, the Class III noncompensatory models do not 
allow a high ability on one dimension to compensate for a lower ability on an- 
other. The lowest of the values of ^£(6^) defines the upper bound of P(x » 

l|§»g)» Although some work has been done on the estimation of parameters for 
the Class III models, no generally accepted algorithm for estimation of the pa- 
rameters of these models is known to exist. 
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Several issues need to be considered in selecting one of these models as a 
description of the interaction between a person and an item. The first is 
whether the model is realistic. This depends on whe aer a compensatory or non- 
compensatory model is appropriate for actual persons and items. Unfortunately, 
this is a question that still needs to be answered, based on research in cogni- 
tive psychology. Although sufficient information is presently not available, 
the applicability of the models to actual testing situations may provide an an- 
swer. 

Beyond questions of the psychological meaningf ulness of the models are 
questions of practicality. The most well developed estimation procedures are 
available for the Class I models; and these models tend to have the most flexi- 
ble options duo to the characteristics of the exponential term. As a conse- 
quence, the Class I models tend to be more promising than the other models. Of 
the Class I models, the generalized Rasch model has the greatest flexibility in 
its options and is the most mathematically tractable. The remainder of this 
paper will therefore concentrate on the properties of this model and the proce- 
dures for the estimation of its parameters. 

Application of the General Rasch Model 

Although the general Rasch model is a generalization of the l-p*rameter 
logistic model, a very simple model, in its most general form the model is very 
complex. A study (McKinley & Reckase, 1982) was thus undertaken to determine 
whether a less complex formulation of the model would be adequate for modeling 
multidimensional response data. ) 

Method 

Design . The general design of this study was to first evaluate the proper- 
ties of a simple formulation of the general model and then to evaluate increas- 
ingly more complex versions of the model produced by inserting additional terms. 
The initial form of the model investigated is given by Equation 20: 

p(x|? r ? i ) = vcgjigt) exp( ^' ? i + "V * 1201 

For each level of model complexity, the properties of the model were investigat- 
ed and the reasonableness and usefulness of the model were explored. This was 
done primarily by generating simulated test data to fit the particular form of 
the model being investigated and by analyzing that data in an attempt to assess 
how well the characteristics of the data matched the characteristics of real 
test data. If it were found that a particular form of the model could not be 
used to generate realistic data in terms of either dimensionality or item char- 
acteristics, then that form of the model was rejected and a different form of 
the model was investigated. Distinct special cases of the model were obtained 
by eliminating different terms from the general model by setting the appropriate 
parameter weights equal to zero. 

Analyses . The analyses of the generated data that were performed included 

; 
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factor analysis and traditional item analysis. The purposes of the analyses 
were three-fold. One purpose was to determine whether the obtained factor 
structure of the data resembled the factor structure typically obtained for real 
test data. The second purpose was to determine whether the obtained unidiraen- 
sional item characteristics (difficulty and discrimination) were similar to 
those obtained for real data. The third purpose was to aid in the interpreta- 
tion of the parameters of the model. 

If it were found that a model could not be used to generate realistic data, 
an attempt was made to determine what changes in the model would yield a more 
acceptable model. In many cases it ras necessary to generate additional data, 
using different values for the parameters of the model in order to answer spe- 
cific questions about a particular model statement. Once an understanding was 
gained as to the roles played by different parameters of the model, predictions 
could be made regarding the effects of adding or eliminating other terms. 

Results . As a result of the analyses performed on the different formula- 
tions of the model, a good understanding of the significance of the terms in the 
model was gained* It is now clear that parameters play quite varied roles de- 
pending on the term of the model in which they appear. Because of this, the 
characteristics of the data> for which the model can be used vary markedly, de- 
pending on the form of the model. 

To begin with, it is dlear that the use of W'gj and U'c^ terms alone is not 

sufficient for modeling multidimensional response data. The linear composite 
represented by the U 1 ^ term in the model determines only item difficulty. 

Moreover, the order of the vector is unimportant. It is the magnitude of the 

inner product of the item parameter vector and the weight vector that determines 
the difficulty of the item. Regardless of whether the vectors have one or five 
elements, as long as the inner product is the same, the difficulty of the item 
in terms of proportion of correct responses is the same. 

It is also clear from the results of the analyses that the product term 
8jVa^ is necessary if item discrimination is to be modeled. When data are gen- 
erated using only the inner product terms, the items modeled have constant dis- 
criminations and the resulting data are unidimensional. When the product term 
is included, the items modeled have varying discrimination. Moreover, the fac- 
tor analysis results indicate that the dimensionality of the generated data is 
determined by the number of elements from the o± vector used in the QfV^ term. 

However, it should be emphasized that if the V matrix contains more than one 
nonzero element in a row or column, a a or 9 term will appear in the exponent 
multiplied by more' than one of the 8 or a parameters, respectively (e.g., 8]Ci + 

or 8jOj + © 2 a 1 ^ * T ^ e P re8ence °f these terms in the exponent results in 
difficulty in determining the meaning of the 8 and a vectors. 

The elements in the vector in the 5iY?i term determine the discrimina- 
tion of the modeled items. Because of this, use of the same elements of the o± 
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vector in both the BjVc^ term and the U t g 1 term produces an undesirable charac- 
teristic in the data* Since the elements used in the term determine item 
discrimination, while the elements in the U*^ term determine item difficulty, 

use of the sane elements in both terms yields items having highly related ob- 
served item difficulty values and item discrimination values* This is not a 
very realistic situation* 

Conclusions . On the basis of the results of the analyses, it was concluded 
that if the model is to be used to represent multidimensional item response 
data, it must include the 8JVoj term, but no element in either the or the 8^ 

vector should be multiplied by more than one term in the other vector* If items 
are to vary in difficulty, the U , a i term must be included; but to avoid highly 

related values for unidimensional measures of item discrimination and difficul- 
ty, no element of the vector should appear in both the l^g^ term and the ©jV^j 

terra. The model that appeared to be most useful for modeling multidimensional 
response data is given by 

where no elements of the vector appear in both terms of the model* 

There was one additional significant finding* Although it was concluded 
that the use of the model without the SjVa^ term was unsuccessful in modeling 

multidimensional response data, this result was obtained when the special case 
of the model was applied to dichotomously scored item response data. This model 
may also be applied to polychotomously scored item response data. In one spe- 
cific application of this model to polychotomous response data, some measure of 
success was attained in modeling multidimensional data using only the V'oi and 

W f 9j terms. Dichotomously scored items were transformed to polychotomous form 
by grouping items together to form clusters having several nominal response cat- 
egories. When these data were analyzed, several dimensions could be determined. 
However, this approach has not been extensively investigated, and any conclu- 
sions drawn as to the usefulness of this approach are at best tentative. 

Estimation of Parameters in the General Rasch Model 

Two basic approaches to estimating item parameters can be distinguished. 
One approach is to specify a distribution of the latent ability of the popula- 
tion from which the sample was taken and then to integrate the response function 
with respect to that distribution to obtain item parameters unconditionally 
(Bock, 1972). This approach has been taken by Bock and Lieberman (1970) and 
Bock and Aitkin (1981). The other approach is to estimate item parameters by 
treating the examinees* abilities as fixed unknowns and by conditioning item 
parameter estimation on estimates of ability (Bock, 1972). This approach has 
been taken by Lord (1968) and Kolakowski and Bock (1970). The present research 
considers both approaches. However, at this time only the conditional item pa- 
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rameter estimation procedure is designed for the most general form of the model; 
it is still limited to the use of dichotomously scored data and it does not es- 
timate the W, U, and V parameter weights. The unconditional item parameter es- 
timation procedure is "designed for use with the form of the model given by Equa- 
tion 21. As was the case with the conditional item parameter estimation proce- 
dure, this procedure does not estimate the parameter weights. 

In addition to the two procedures that have been developed for item parame- 
ter estimation, a maximum likelihood ability estimation procedure has been de- 
veloped for estimating the ability parameters for the general Rasch model. This 
procedure estimates ability conditionally and is combined with the conditional 
maximum likelihood item parameter estimation procedure discussed above to form a 
conditional maximum likelihood estimation procedure for simultaneous estimation 
of the item and ability parameters of the general Rasch model. Ifeed alone, the 
conditional maximum likelihood ability estimation procedure can be used to esti- 
mate ability using the item parameter estimates obtained from the unconditional 
item parameter estimation procedure. 

Unconditional Item Parameter Estimation 

General procedure . The unconditional item parameter estimation procedure 
is an adaptation of a procedure proposed by Bock and Aitkin (1981), which was 
designed for use with a multidimensional 2-parameter normal ogive model (see 
Equation 13). In the initial step of this procedure, a distribution of ability 
is assumed, and quadrature nodes and weights are selected for use in determining 
expected sample sizes for portions of the distribution using numerical integra- 
tion. For the multidimensional case, the prior distribution of ability is mul- 
tivariate, and the nodes and weights are vectors. At each node the expected 
number of examinees from the sample having the ability represented by the node 
is computed, as is the expected frequency of correct responses to each item by 
examinees with the ability represented by the node. These expected number-cor- 
rect scores and expected sample sizes are used in a logit analysis, which is 
performed using a least squares regression procedure. The results of the logit 
analysis are estimates of the parameters of the model. 

The initial stage of the estimation procedure requires provisional esti- 
mates of the item parameters. These provisional estimates are used in the first 
step of the initial stage, which involves obtaining expected sample sizes and 
number-correct scores. In the second step of the initial stage, a logit analy- 
sis is performed to obtain new estimates of the item parameters. These new es- 
timates are used in the first step of the second stage, which involves obtaining 
new estimates of sample sizes and number-correct scores. These new sample sizes 
and number-correct scores are used in another logit analysis, which yields a new 
set of item parameter estimates. These stages are repeated until a criterion of 
convergence is met or until a limit on the number of stages is reached. 

Expected sample sizes and number-correct scores . The expected sample size 
at each node is given by 

M = E 1 . k k » [22] 
£-1 P, 
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where 

is the expected sample size at node k; 

8^ is the ability represented by node k; 

is the weight for node k; 

N is the number of examinees in the sample; and 

1^(8^) is the likelihood of response vector 2, given node Ic. 

L£(e k ) is given by 

VV = V ' t23] 

where P(u^£) is given by 

P( Uil ) « exptu^C^ + A^)] I & + ex Pt c i + A ± § k ]) , [24] 
where 

u^^ is the response to item i^ in response vector 

c± is the difficulty parameter for item jl, and 
A^is given by 

M 

*A " \ a im 9 km , [251 
m=l 

where 

M is the number of dimensions, 
a^ m is the mth element of the discrimination parameter vector for 
item i^, and 

^kxn * 8 t * le S^ 1 e l ement of node k. 
is given by 

*i = j x ■ [26] 

where % is the number of quadrature nodes. The sum over all of the nodes of the 
ratio in Equation 22 is one, and the sum of the N k over all of the nodes is N, 
where N is the number of examinees in the sample. 

The expected number-correct score for item i^ at node k, r , is given by 



ik' 

= E 1& \ k * , [27] 



4-1 P £ 



where the other terms are as previously defined. The sum over all of the nodes 
of the r^ for an item is equal to the observed number-correct score for the 



item. 
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Multiple loglt analysis . For each Item the expected proportion-correct 
score at each node Is given by 

f ik - 7 ik 1 \ • ™ 

where all of the terms are as previously defined. The P^ are converted to log- 
its and used as the dependent variable in a regression analysis. The indepen- 
dent variables in the regression analysis are the elements in the node vectors. 
The model for the regression analysis is given by 

lo8 e [ *ik 1 (1 " *ik )] " C i + *i§k + err ° r 9 [29] 

where all of the terms are as previously defined. The regression analysis re- 
sults in estimates of c^ and 

Conditional Item Parameter Estimation 

This procedure is based on the conditional maximum likelihood estimation 

technique. The procedure begins with the computation of an initial weighted 

item score on each of M dimensions using 

./- 

N N 

X ik = ^ U jk + ^ fl jk V jk ' [30] 
where 

is the initial score for item _i on dimension 1c, 

Uj k is the kth element of U, and 

Vjk is the kth element of V # 
The index j_ indicates that *he value of the elements of U and V are dependent on 
the response of the jth examinee to item I. These scores are converted to jz 
scores via the transformation 

z ik - (x ik - V 1 \ • t311 

where 

z ik * 8 t * ie .5. score for item _i on dimension Jc, 

is as defined in Equation 30, 

7^ is the mean of the weighted scores on dimension Jc, and 

is the standard deviation of the weighted scores on dimension Jc. 

The scores are used as initial estimates of the item parameters. That is, 

5 oik = Z ik , t32] 

where 5 oik is the item parameter estimate of item i_ on dimension k after 0 iter- 
ations . 



ERIC 



20 



New item parameter estimates are obtained using the initial item parameter 
estimates as the starting point in an iterative process. One iteration is com- 
plete when a new item parameter estimate is obtained for each item on each di- 
mension. Within a single iteration, new estimates on the first dimension are 
obtained while holding the estimates on all other dimensions constant at the 
values obtained on the previous iteration. Estimates on the second dimension 
are obtained using previous estimates on all other dimensions except the first 
dimension. For the first dimension the new estimates are used. An iteration is 
complete when new estimates have been obtained on all dimensions. Iterations 
continue until a criterion of convergence is met or until a limit on the number 
of iterations is reached. 



On the 2,th iteration, the new estimate on the jcth dimension for item i is 
given by ~~ 



°Uk = °<i. 



Dik + [i 



ik 



l°g e L0 lk ) 



/ 



3a 



ik 



[33] 



where ^(^-i)^ *s the estimate for item i^ on dimension Jc from the previous iter- 
ation and log L is the log to the base ji of the likelihood function for the re- 
sponse vector for item i. The likelihood function is given by 

N 



L(o ik ) = n p(u„) , 



[34] 



where P(uy ) is the probability of response u^ by person j[ to item ^i with pa- 
rameter o ik . is given by 

exp(W l -g 1 + U 1 -o 1 + 9 i 7 1 o i + 2l ) 

P(u ij =ll) "exptW^ej+U^o^^V^+z^ + expfW^e^^o^^^^] 9 [35] 



where all of the terms are as previously defined. The 0 and 1 subscripts on the 
vectors of weights, U, V, and W, indicate the values taken by those vectors for 
an incorrect and a correct response, respectively. 

3he first derivative of the log e likelihood function is given by 
- N N N N 

where all of the items are as previously defined. The second derivative of the 
log e likelihood function is given by 

aa 4 .- , J-1 
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where is the prjbability of a correct response to item i^ by person « 
1 - Py, and the otvxer terms are as previously defined. 

Conditional Ability Parameter Estimation Procedure 

The conditional ability estimation procedure is also a maximiaa likelihood 
estimation procedure. It is very similar to the conditional item parameter es- 
timation procedure. For each examinee an ^aitisl weighted score is computed on 
each of M dimensions as 



n 

= E w 



i=l 



'jk + 1 f 1 V jk a ik • 



[38] 



where Xj k is the initial score for person j_ on dimension jc. These scores are 
converted to z scores via the transformation 



'jk 
where 



(X 



'jk 



[39] 



is the £ score for person j_ on dimension k, 



Z jk 

is as defined in Equation 38, 

TZ^ is the mean of the weighted scores on dimension k, and 

is the standard deviation of the weighted scores on dimension k. 
The £ scores are used as initial estimates of ability. That is, ~* 



8 ojk 

where 0 
tions. 



'jk 



[40] 



ojk 



is the ability estimate of person^ on dimension k after 0 itera- 



Estimates of ability are obtained using the same iterative procass as was 
described for the conditional item parameter estimation procedure. On the £th 
iteration, the new estimate on the kth dimension for person j is given by 



*jk 



Vl)jk + 



gg— iog e L(e jk ) 



jk 



36 



jk 



[41] 



where 8(a_i)jk lf 3 the estimate for person j_ on dimension 1c from the previous 

iteration and log L is the log to the base e of the likelihood function for the 

response vector for person The likelihood function is given by 
n 



L( V 



n POO , 



i=l 



[42] 
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where p ( u y) i8 the probability of a response u^ to item jL^ by person j_ with 
ability 8j k . PCujj) is given by Equation 35. In evaluating 6j the moat recent 
item parameter estimates are used. 

The first derivative of the log likelihood function is given by 
Sj- log.L^) - *ZV A -U io + y o a l)Qlj - ifi ml + y^Py , [43] 

where all the terms are as previously defined. The second derivative is given 
by 

lO^LC^) - + ?1 + + PyQy , [44] 

where, again, all the terms are as previously defined. 

Evaluaton of the Estimation Procedures 

General Design 

The general approach taken to evaluate the estimation procedures was to 
apply the procedures to test data generated to fit a two-dimensional version of 
the model given by Equation 24 and then to compare the estimates of the parame- 
ters with the values used to generate the simulation data. For this purpose a 
data set comprising response data for 50 items and 1,000 examinees was generat- 
ed. Three parameters for each item and tw parameters for each examinee were 
used to generate these data. The values used for the item parameters are shown 
in Table 1. The examinee ability parametero were selected from a bivariate nor- 
mal distribution with p - 0, y - 0, and I equal to the identity matrix. 

The weight vectors used in this study ware as follows. For an incorrect 
response, all of the weight vectors were set equal to zero. For a correct re- 
sponse, the following matrix and vectors were used: 



W - (0,0) 
U - (1,0,0) 

0 10 
0 0 1 



[45] 
[46] 



[47] 



As can be seen, only the first item parameter was selected by the weight vector, 
U, to act as the item difficulty parameter. The other two item parameters were 
selected by the weight matrix, y, to act as discrimination parameters. The re- 
sulting model is given by 
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Table 1 

True and Estimated Parameters 



True Estimates 



Parameters Unconditional Conditional 



Item 


1 


2 


3 


1 


2 


3 


1 


2 


3 


1 


-.65 


1.50 


.50 


-.51 


1.45 


.44 


-.48 


1.27 


.36 


2 


-1.40 


.50 


1.25 


-1.34 


.48 


1.29 


-1.56 


.45 


1.29 


3 


-.20 


1.35 


.15 


-.02 


1.49 


.40 


-.06 


1.69 


.25 


4 


.40 


1.60 


.55 


.64 


1.67 


.84 


.59 


1.58 


.63 


5 


.00 


.50 


1.15 


.19 


.48 


1.02 


.10 


.51 


1.04 


6 


-1.3G 


.35 


1.05 


-1.31 


.50 


1.17 


-1.68 


.42 


1.25 


7 


.05 


1.45 


.35 


.28 


1.56 


.43 


.34 


1.63 


.35 


8 


.19 


.25 


1.40 


.20 


.27 


1.18 


.25 


.20 


1.23 


9 


-.17 


.85 


.85 


-.02 


1.07 


.87 


-.07 


.89 


.67 


10 


.14 


1.75 


.45 


.40 


1.60 


.60 


.34 


1.45 


.57 


11 


.37 


.60 


.80 


.40 


.66 


1.02 


.29 


.61 


.88 


12 


.87 


1.65 


.65 


.82 


1.49 


.63 


.70 


1.38 


.46 


13 


-.93 


.35 


1.35 


-.98 


.26 


1.58 


-1.00 


.24 


1.65 


14 


1.85 


.65 


1.65 


1.80 


.70 


1.55 


1.59 


.54 


1.29 


15 


.06 


.65 


.65 


.16 


.73 


.66 


.00 


.80 


.52 


16 


-.41 


.45 


1.45 


-.31 


.48 


1.53 


-.38 


.23 


1.60 


17 


-1.54 


.75 


1.25 


-1.41 


.72 


1.32 


-1.55 


.71 


1.14 


18 


.34 


1.55 


.25 


.44 


1.50 


.32 


.39 


1.46 


.29 


19 


-.15 


.65 


1.35 


.14 


.90 


1.38 


.04 


.65 


1.28 


20 


1.48 


1.25 


.45 


1.54 


1.41 


.41 


1.42 


1.32 


.22 


21 


-1.45 


1.65 


.45 


-1.42 


1.69 


.46 


-1.73 


1.80 


.48 


22 


.75 


.45 


1.35 


.75 


.45 


1.42 


.67 


.40 


1.35 


23 


-.75 


.35 


1.55 


-.62 


.33 


1.75 


-.63 


.21 


1.72 


24 


1.10 


1.10 


.30 


.99 


.92 


.37 


.93 


1.11 


.35 


25 


-.55 


1.20 


.15 


-.42 


1.35 


.29 


-.38 


1.37 


.24 


26 


.50 


.50 


1.00 


.44 


.66 


1.03 


.28 


.49 


1.12 


27 


-.15 


1.45 


.45 


.10 


1.36 


.52 


-.02 


1.32 


.42 


28 


.65 


.70 


.70 


.78 


.63 


.66 


.56 


.68 


.86 


29 


-1.00 


1.00 


.30 


-1.02 


1.16 


.37 


-.95 


1.14 


.38 


30 


1.00 


.30 


1.00 


1.18 


.27 


1.27 


.98 


.27 


1.14 


31 


-.25 


.95 


.25 


-.16 


.99 


.20 


-.04 


1.11 


.24 


32 


-.70 


.15 


1.50 


-.65 


.17 


1.46 


-.68 


.03 


1.51 


33 


.85 


1.15 


.45 


1.10 


1.31 


.42 


.98 


1.29 


.30 


34 


.05 


.10 


.95 


-.02 


.13 


.92 


.03 


.20 


1.09 


35 


-.95 


1.35 


.50 


-.79 


1.39 


.64 


-.80 


1.28 


.52 


36 


-1.50 


.20 


1.20 


-1.45 


.36 


1.07 


-1.38 


.47 


1.03 


37 


1.80 


1.55 


.55 


2.16 


1.84 


.35 


2.06 


1.51 


.44 


38 


-2.00 


.15 


1.15 


-1.98 


.32 


1.05 


-2.06 


.30 


1.09 


39 


-.90 


1.40 


.35 


-.70 


1.38 


.22 


-.70 


1.18 


.40 




i nn 
1 • uu 


i nn 
1 • uu 


i nn 
1 • uu 


i n a 
1 • uo 


QO 
• ?Z 


Q A 

• 70 


i n a 
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41 


.15 


1.25 


.70 


.26 


1.33 


.80 


.28 


1.17 


.75 


42 


-1.50 


.25 


.95 


-1.45 


.40 


.85 


-1.60 


.49 


1.14 


43 


-1.25 


.35 


1.45 


-1.02 


.27 


1.42 


-1.13 


.26 


1.30 


44 


1.25 


1.30 


.25 


1.37 


1.25 


.19 


.95 


1.26 


.26 


45 


-2.00 


1.15 


.15 


-1.89 


.99 


.09 


-2.01 


1.30 


.13 


46 


1.75 


.50 


.50 


1.80 


.52 


.59 


1.65 


.68 


.64 


47 


.65 


.65 


1.30 


.82 


.48 


1.39 


.71 


.49 


1.27 


48 


-.25 


1.00 


.45 


-.19 


1.03 


.59 


-.17 


1.03 


.53 


49 


.35 


.55 


1.15 


.40 


.59 


1.14 


.24 


.48 


1.04 


50 


.00 


.95 


.15 


-.05 


.87 


.25 


-.07 


1.11 


.26 
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Results 



Table 1 shows the item parameter estimates obtained from both the condi- 
tional and unconditional item parameter estimation procedures. The estimates 
have be^n scaled to have the same means and standard deviations as the corre- 
sponding true item parameters. The correlations of the estimates with the true 
values are shown in Taule 2. As can be seen, for these data there was very lit- 
tle difference in the quality of the estimates yielded by the two estimation 
procedures. Of course, this comparison is based on simulation data and on only 
one data set. Clearly, more research is needed before any definite conclusions 
about these procedures can be drawn. 

Table 2 

Intercorrelation Matrix for True and Estimated Item Parameters 



Estimation Procedure 



True Unconditional Conditional 



Parameter 


°Tl 


°T2 


°T3 


°U1 


°U2 


°U3 


6 Cl 


6 C2 


d C3 


°T1 


1.00 


.21 


-.12 


.99 


.20 


-.12 


.99 


.15 


-.18 


°T2 


.21 


1.00 


-.75 


.20 


.97 


-.72 


.15 


.96 


-.79 




-.12 


-.75 


1.00 


-. 12 


-.72 


.97 


-. 18 


-.79 


.96 


yi 


.99 


.20 


-.12 


1.00 


.24 


-.13 


.99 


.19 


-.20 




.20 


.97 


-.72 


.24 


1.00 


-.73 


.19 


.96 


-.80 


°U3 


-.12 


-.72 


.97 


-. 13 


-.73 


1.00 


-.20 


-.80 


.97 


°C1 


.99 


.15 


-.18 


.99 


.19 


-.20 


1.00 


.20 


-.22 


! C2 


.15 


.96 


-.79 


.19 


.96 


-. 80 


.20 


1.00 


-.88 


5 C3 


-. 18 


-.79 


-.96 


-.20 


-.80 


.97 


-.22 


-.88 


1.00 



Discussion 

The purposes of this paper were threefold. First, the fundamental concepts 
required when considering multidimensional models for the interaction of a per- 
son and a test item were defined. These concepts included the multidimensional 
latent space, the item difficulty function, and the item discrimination func- 
tion. These definitions were conceived as multidimensional generalizations of 
similar concepts in unidimensional IRT models. Second, six existing multidimen- 
sional models were reviewed and, on the basis of their similarities, were clas- 
sified into three general categories. The characteristics of these categories 
were described, and the general Rasch model was selected for further study on 
the basis of ease of parameter estimation. Third, estimation procedures for the 
parameters of the general Rasch model were described and applied to a set of 
simulation data that had been generated according to a two-dimensional special 
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case of the model. The results indicated that a very close correspondence had 
been obtained between the estimated item parameters and those used to generate 
the simulation data. 

On the basis of this information, two conclusions can be drawn. First, the 
concepts of difficulty and discrimination can be generalized to the multidimen- 
sional case, but the results are slightly different for compensatory and noncom- 
pensatory models. For the compensatory models, the item difficulty is defined 
by a linear function of the ability dimensions, while for the noncompensatory 
models, difficulty is defined by a vector of difficulty parameters. In both 
cases the slope of the item response surface at the difficulty function is a 
function of the discrimination parameters of the model. 

A second conclusion that can be drawn is that the parameters of the general 
Rasch model can be estimated with acceptable accuracy for the simple two-dimen- 
sional case presented in the paper. This is, of course* very minimal evidence 
for the value of the estimation procedures; but combined with the work of Bock 
and Aitkin (1981), the results look fairly promising. 

The results presented in this paper summarize only the initial steps in a 
thorough study of the applicability and usefulness of multidimensional latent 
trait models. Much further work needs to be done. The sample size requirements 
and the characteristics of the estimates obtained from the estimation procedures 
need to be determined. Procedures for determining the number of dimensions re- 
quired for the multidimensional latent space must be developed as well as guide- 
lines for interpreting the dimensions. The usefulness of the models for real 
data applications such as test construction and adaptive testing should be in- 
vestigated. Also the advantages of these procedures over existing multidimen- 
sional procedures, such as factor analysis, should be studied. 
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